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This paper develops the Jungle model in a credit portfolio framework. The Jungle model is able to 
model credit contagion, produce doubly-peaked probability distributions for the total default loss and 
endogenously generate quasi phase transitions, potentially leading to systemic credit events which happen 
unexpectedly and without an underlying single cause. We show the Jungle model provides the optimal 
probability distribution for credit losses, under some reasonable empirical constraints. The Dandelion 
model, a particular case of the Jungle model, is presented, motivated and exactly solved. The Dandelion 
model provides an explicit example of doubly-peaked probability distribution for the credit losses. The 
Diamond model, another instance of the Jungle model, experiences the so called quasi phase transitions; 
in particular, both the U.S. subprime and the European sovereign crises are shown to be potential 
examples of quasi phase transitions. We argue the three known sources of default clustering (contagion, 
macroeconomic risk factors and frailty) can be understood under the unifying framework of contagion. 
We suggest how the Jungle model is able to explain a series of empirical stylized facts in credit portfolios, 
hard to reconcile by some standard credit portfolio models. We show the Jungle model can handle 
inhomogeneous portfolios with state-dependent recovery rates. We look at model risk in a credit risk 
framework under the Jungle model, especially in relation to systemic risks posed by doubly-peaked 
distributions and quasi phase transitions. 
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1. Introduction 


Clustering of corporate defaults is relevant for both macroprudential regulators and banks’ senior 
management. With a robust modelling of credit losses, macroprudential regulators may analyse and 
manage the risk of systemic events in the economy, and banks’ senior management may compute 
the capital needs out of their core credit portfolios. 

Historical corporate default rate data, as described in (Moody’s Investors Service 2011) and 
( |Giesecke et al. 2011), signal the sensitivity of credit defaults to systemic events in the economy, 
from the Great Depression and the 2007-2009 Great Recession, to the savings and loans crisis and 
the burst of the dotcom bubble, as it can be seen from Figure 1. 



Year 


Figure 1. Historical default rates for Global Speculative-Grade bonds, from (Moody’s Investors Service|[2011) 


Standard credit portfolio models were not able to model the tail risks in credit portfolios when 
the U.S. subprime and the Spanish Real Estate bubbles bursted. Some of these models introduce 
default correlations through the dependency of the probabilities of default on macroeconomic fac¬ 
tors describing the state of the economy. As a consequence, when the state of the economy is 
” good”, the probabilities of default tend to go down. Conversely, when the state of the economy is 
’’bad”, the probabilities of default tend to go up. Averaging over the business cycle induces default 
clustering. 

However, these predicted default correlations tend to be low in comparison to empirical evidence, 
and the corresponding probability distribution of the losses shows ” thin tails”. It is widely accepted 
that in addition to the dependence on macroeconomic risk factors, a reasonable credit risk model 
should include contagion effects, too. 

Contagion effects should often give rise to doubly-peaked probability distributions for the credit 
losses, with the first peak being close to the peak of an equivalent binomial distribution (when 
contagion effects are weak, and the defaults can be considered as roughly independent, which is 
usually the case when the state of the economy is ’’good”) and a second peak, at higher losses, 
corresponding to avalanches / domino effects of credit defaults due to contagion. 

This paper has the purpose to show a particular class of credit risk model, the Jungle model Q 


lr The name ” Jungle” provides intuition for the complex network of dependencies among the constituents of a credit portfolio. 
Moreover, since the lion is the King of the Jungle, we will see the Dandelion model (from the French ” dent-de-lion”, or lion’s 
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is able to: 

(i) Model contagion among borrowers 

(ii) Endogenously generate doubly-peaked probability distributions for the credit losses. As 
opposed to the case of single-peaked probability distributions, for which higher credit losses 
are always less likely than lower losses (at the large loss regime), doubly-peaked probability 
distributions show the distressing phenomenon that very large losses may be more likely to 
happen than moderately large losses 

(iii) Show how credit systemic events may occur suddenly and unexpectedly. A credit portfolio 
may inadvertently cross a ” quasi phase transition point”, and its collective behaviour change 
all of a sudden, potentially creating systemic events. We want to emphasize that intuition 
usually tells us a systemic crisis requires a strong single cause originating it; however, this 
is not necessarily true. We will show a systemic crisis can be created without a strong 
underlying, single cause, and we will learn how to recognize those ’’quasi phase transition 
points” 



Figure 2. A general Jungle model 

Section 4 presents the Jungle model and shows the Jungle model is the optimal probability 
distribution for modelling losses in a general credit portfolio, under two assumptions: 

(i) The Maximum Entropy principle (to be described in Section 3) is the right guiding principle 
to select the probability distribution of losses in the framework of credit risk modelling 

(ii) All the empirical information of a given credit portfolio can be summarized as probabilities 
of default and default correlations of its constituents 

In Section 4, we restrict to (close to) homogeneous portfolios, binomial default indicators and no 
modelling of recovery rates, without loss of generality as discussed in Section 7. 

Section 5 tries to motivate the use of the Jungle model. In particular, we show that when there is 
no empirical information available on default correlations, the Jungle model becomes the binomial 
distribution (as it should). We also introduce contagion perturbatively around the binomial model, 
and we show the behaviour of the corresponding interacting system is the one we would expect 
intuitively. 


tooth) is the King of the Jungle of contagion models, since the Dandelion may describe the contagion arising from banks, which 
are the main source of systemic risks. 
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Section 5 continues with the presentation of a two particular cases of the Jungle model, the 
Dandelion model and the Diamond model, both of them are interacting models through contagion. 




Figure 3. A Dandelion and a Diamond 


The Dandelion model assumes a central element in the credit portfolio is connected through 
contagion with the rest of the nodes in the portfolio, and no other pair of nodes is connected. 
Intuitively, the Dandelion model mimics the relationship between a bank and its many borrowers, 


or even between a Central Bank with the rest of the economy, see (Bundesbank 2011). 


We show the Dandelion model displays a doubly-peaked loss distribution, endogenously generated 
through contagion. We also find the results of this model can be interpreted as an endogenously 
generated two-valued mixture model: the two states of the central node can be understood as the 
two states of the economy, with the probability of default in the ’’bad” state of the economy being 
higher than the probability of default in the ’’good” state of the economy, by an amount given by 
the variable representing contagion. In a sense, the Dandelion model provides a unifying way to 
think about both contagion and macroeconomic risk factors. 

We argue the Diamond model experiences a quasi phase transition for a not unreasonable set of 
empirical parameters, showing quantitatively that a small change in the empirical data may result 
in significant changes for the profile of the probability loss distribution, leading to severe systemic 
risks. There is a pictorial analogy with the phase transition of water into steam: if we increase one 
degree Celsius the temperature of water at 98 degrees Celsius, the resulting water at 99 degrees 
Celsius continues ’’being water” (small details will change, for example a thermometer inside the 
water will show a small increase in its readings, but water will remain ’’being water”). However, 
when the temperature increases a further degree Celsius, there is a sudden change in the collective 
behaviour of water, becoming steam. In an analogous way, when the default correlation in the 
Diamond model is increased a bit above the ’’quasi phase transition point’s default correlation” (to 
be calculated from the model), the shape of the probability loss distribution (basically the same 
for all default correlations below that default correlation) changes to a qualitatively different one 
(which remains basically the same for all default correlations above that default correlation). 

In Section 6, we motivate the use of the Jungle model to study model risk in a credit risk 
framework, i.e. we show how and when the Jungle model can cope with the inherent uncertainty 
under systemic credit events, such as the ones presented in the previous section (doubly peaked 
distributions and quasi phase transitions). 

In Section 7, we show the Jungle model can be used to model inhomogeneous portfolios, gener¬ 
alizing straightforwardly the binomial loss indicators in Section 4. Even more, we show the Jungle 
model can straightforwardly be generalized to cope with state-dependent recovery rates, modelling 
the stylized fact that recovery rates go down when default rates increase. 

In Section 8, we argue the three known sources of default clustering (contagion, macroeconomic 
risk factors and frailty; see the next subsection, ’’Related literature”, for further details) can be 
understood under the unifying framework of contagion. 
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In Section 9, we provide a series of policy implications arising from our contagion models. In 
particular, we show both the U.S. subprime and the European peripheral crises can be understood 
as particular instances of quasi phase transitions. Also, we are able to understand qualitatively 
other empirical evidence, such as the thick tails in the historical probability distributions of credit 
losses presented in Section 2, as well as the surprising fact that quite often, the worst quality credit 
portfolios end up with default rates lower than the corresponding ones with a better rating. We also 
pictorially analyse the ’’Too Big To Fail” phenomenon under our framework based on contagion, 
and we compare systemic risks out of contagion for a ” financial economy of big corporates” versus 
an economy of ” industrial entrepreneurs”. 

The final section concludes with a summary of the results. 


1.1. Related literature 


Recent literature suggest there are three main sources of credit clustering: macroeconomic risk 
factors, contagion and frailty. 

Macroeconomic risk factors, such as S&P 500 returns or short term rates, are common to all 
credits in the portfolio. When the economy grows strongly, the conditional probabilities of default 
are low. On the contrary, when the economy weakens, the conditional probabilities of default 
increase. The passage in time of the business cycle induces in a natural way a correlation among 
credits. Many standard credit portfolio models can be understood as particular instances of a mixed 
bi nomial model, see ([Embrechts et «i.||2003 ). 

(Azizpour et al. 2014) and (Das et al. 2007) reject the hypothesis that macroeconomic risk factors 


are able to fully explain the clustering of corporate defaults by themselves, even though (Lando 


et al. 2010) argues on the contrary. 


Contagion can be understood as direct links among credits, such as the ones in a supply chain, 
or the bank-creditor relationship. A financial crisis may be a prototypical case of contagion, since 
banks tend to be highly connected with large parts of the economy, and their financial failure may 
create a deleveraging, impacting directly on the balance sheet of their borrowers. Contagion was 
analysed with a dynamical approach in ( Davis et q 7.| 2001), (Jarrow et al. 2001), (Giesecke et al. 
2004), ( Schdnbucher||2004 ), (Liitkebohmert 2009), ( Steinbacher et a/.||2013 ) and in an Ising setting 
by (Molins et al. 2005), followed by ( |Kitsukawa et al. 2006) and (Filiz et al. 2012). 

Frailty can be described as the ’’Enron effect”: once the disputable accounting practices were 
revealed to the public, the probabilities of default of many other companies, in different sectors and 
regions, readjusted according to the new information. Most likely, no direct links out of contagion 
between Enron and those companies ever existed, but default correlations arose nonetheless. 

( Azizpour et aZ.|2014 ), ( Duffle et a/.|2009 ), ( |Lando et a7|2010| ) and ( |Koopman et al. 2011) include 
frailty, contagion or both in order to try and explain the clustering of corporate defaults, on top 


of macroeconomic risk factors. (Azizpour et al. 2014) and (Koopman et al. 2011) conclude both 


frailty and contagion are necessary to fully explain the clustering of corporate defaults in their 
datasets, on top of the macroeconomic risk factors. 

This paper differentiates from the rest of the literature on contagion networks in the credit arena 
by arguing its results are independent from the specific details of the ’’microscopic” credit interac¬ 
tions. In particular, the Maximum Entropy principle argues that given a set of empirical moments 
of an, in principle, unknown probability distribution, the ’’best” probability distribution is the 
Jungle model. The relevant empirical data in the credit arena is known by market participants to 
be probabilities of default and default correlations. We show that assuming both the probabilities 
of default and the default correlations correspond to the empirical moments of the unknown prob¬ 
ability distribution of credit losses, the Jungle model arises ’’naturally” and without the need to 
impose the specific knowledge of the credit interactions among the constituents of the considered 
credit portfolio. 
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2. The data 


We use Moody’s All rated Annual Issuer-Weighted Corporate Default Rates, from 1920 to 2010, 
see (Moody’s Investors Service 2011), and (Giesecke et al. 2011) value-weighted default rates on 
bonds issued by U.S. domestic nonfinancial firms from 1866 to 2008. 

As often discussed in the literature and among practitioners, default rate data tends to have 
issues regarding its interpretation as default losses. This is even more the case for such a long 
term data set as the ones we use. Our approach is pragmatic: (Moody’s Investors Service 2011) 
and (Giesecke et al. 2011) data are no-nonsense, since even though the data definition process is 
probably not ’’rigorous” enough (and it cannot be), the data probably is robust enough (the data 
contains several full business cycles in both cases). 


One of the reasons we use a longer data set than is customary in the literature ((Azizpour et al. 


2014) uses data starting on 1970; (Das et al. 2007) on 1979; (Lando et al. 2010), on 1982; (Duffie 


et al. 2009), on 1979) is that our models do not require the use of macroeconomic or firm-specific 


data, so we can go backwards as far as we want, while there is still default rate empirical data. 
On the contrary, for example the S&P 500 was launched on 1957, so a researcher needs a lot of 
ingenuity to be able to find the corresponding macroeconomic and firm-specific data corresponding 
to several decades ago. 

From (Moody’s Investors Service 2011), using the default rate for speculative grade bonds, as 
well as the number of defaults corresponding to speculative grade bonds, we are able to compute 
approximately the total amount of speculative grade bonds for each year. For the rest of the paper, 
when we try to model speculative grade bonds, we will use the average number, close to 800. 

Unfortunately, the corresponding data for single ratings is not provided in the paper. As a 
consequence, when we deal with Caa-C ratings, we arbitrarily reduce the number for speculative 
grade bonds by one order of magnitude, 80. 


3. Credit portfolio modelling 


A credit portfolio consists of N credit instruments. A credit portfolio model is a theoretical construct 
providing as an output the unconditional probability distribution for the losses of a given credit 
portfolio (the unknown we will focus our attention on for the rest of the paper). 


Moody’s KMV (Merton 1974), CreditMetrics (Gupton et al. 1997), CreditRisk+ (CreditSuisse 


1997) and CreditPortfolioView (Wilson 1997) are commercially available credit portfolio models. 


Additionally, the Gaussian copula (Li 2000) became a widespread tool to value credit derivatives. 


None of these credit portfolio models were able to model tail risks adequately during either the 
U.S. subprime crisis or the sovereign and banking crisis in peripheral Europe. Especially, some 
observers believe the Gaussian copula was a ’’recipe for disaster”, see (Wired 2009), (FT 2009), 
when modelling credit tail risks. 

The losses of a credit portfolio can be calculated as: 


N N 

L = ^L i = ]T£)( l- RR i)k (1) 

2=1 2=1 

where Ej denotes the Exposure at Default, i.e. the maximum potential loss out of the credit 
instrument i (usually, the nominal of the bond or loan), (1 — RRi) denotes the Loss Given Default 
(RR stands for Recovery Rate) describing the fraction of the Exposure at Default that is effectively 
lost when the i-th borrower defaults, and li is an indicator taking values in {0,1}, and which 
describes if the i-th borrower is defaulted or not. 

In general, real world cases, li variables are stochastic, as well as the recovery rates, and the 
portfolio is inhomogeneous (in general, E) ^ Ej for at last some i ^ j). The modelling for the 
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related probability distribution of losses is challenging. 

We will state our credit portfolio model has been solved when we have found the probability 
distribution for the losses of that portfolio, L. Our target for the rest of the paper will be to 
motivate, calculate and analyse the probability distribution of L. 

From now on and until Section 7, we will make the simplification of analysing homogeneous 
portfolios (with Exposure at Default set at 1), and we will not model Recovery Rates (which is 
analogous to assume the Recovery Rates are constant and the same for all borrowers). In Section 
7, we will deal with the general case of inhomogeneous portfolios and state-dependent Recovery 
Rates. We will show the simplifications described above do not represent a loss of generality. 

As a consequence, the state space simplifies to a set of discrete variables taking values 0 or 1, 

N 

D = { (ii , I 2 , ■ ■ ■, In) I h £ {0,1}, i = 1, 2,... , N}. The loss simplifies to i = h- 

2—1 

The probability distribution of a random variable is, in general, unknown and unobservable per 
se. One possible way to derive it is to aggregate the dynamical, ’’microscopic” processes underlying 
the random variable. For example, modern physics has been successful at stating microscopic 
dynamical laws from first principles (quantum mechanics and quantum field theory), and finding the 
related macroscopic equations (thermodynamics) through an averaging process called Statistical 
Mechanics. 

However, in social sciences this process is fraught with difficulties. In general, the underlying 
dynamical processes are unknown. The usual methodology then is as follows: 

The probability distribution of a random variable is not an observable. But there are observables 
of the random variable which can be understood as direct calculations from using the probability 
distribution. For example, the expected value of the random variable is the first moment of the 
corresponding probability distribution. The variance and the correlation correspond to the second 
moments of the probability distribution. Skewness and kurtosis, which are widely used empirical 
observables, are the third and fourth moments of the distribution. 

A mathematically well behaved probability distribution can be fully described by its moments. 
In particular, our underlying random variable, the loss in a credit portfolio, is a bound variable, 
so we are not concerned about the possibility of moments becoming infinite when the tail of an 
unbound probability distribution is ”fat enough”, see (Bouchaud et al. 2003). As a consequence, it 
makes sense to assume that despite the probability distribution being not observable, an analyst 
may finally recover it through the empirical knowledge of its moments (or in general, through 
the knowledge of the expected value of a general function; the moments are expected values of 
polynomials). 

The question is then: given the knowledge of all or some of its moments, is there a way to find 
the general form of the probability distribution of the underlying random variables? 

The Maximum Entropy principle, or Maxent, provides a specific answer to this question. Maxent 
asserts: 

Given a finite state space D, the probability distribution in H that maximizes the entropy and 
satisfies the following m < card(D) constraints, given m different functions in H, fk(x), and m 
fixed numbers F^: 


(fk(x )) := ^2 P ( x )fk( x ) = F k, k = 1, 2, • • • , m (2) 


as well as a normalization condition: 


(1) :=^P(x) = l 


(3) 
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is: 


P(x) = 


1 


A 2 , ■ ■ • , A m) 
where Z is called the partition function: 


ex P ( - 

\ i =1 


(4) 


Z{ Ai, A 2 , • • - , A m ) = ^exp ~y^ y Xifj{x) 


(5) 


O \ i =1 / 

The Lagrange multipliers Aj are found by inverting the following set of m equations: 


F k = ifk{x)} = - 


d logZ(Ai, A 2 , • • • , A m ) 
d\ k 


k = 1 , 2 ,- 


, m 


( 6 ) 


The intuition behind Maxent is P(x) is the ’’best” [^probability distribution an analyst can come 
up with, assuming all the empirical evidence about the problem at hand is summarized as expected 
values of functions (the F\ numbers and the fk(x) functions, respectively). The expected values 
are taken over the (unknown) probability distribution P(x). The claim above is further discussed 
at Appendix A. 

It often happens that while the ’’real” probability distribution of a given system is unknown, 
some constraints are naturally known. For example, in the trivial case of throwing a dice, we know 
that whatever the correct probability distribution is, the probabilities for each state (each of the 
six faces of the dice) must add up to one. In fact, Maxent for the dice gives a uniform probability 
distribution, with a p = \ for each of the faces of the dice. 

In the same way, if we know, in addition to the fact that all probabilities must add up to one, 
the expected value of the random variable, Maxent produces the binomial distribution. We will 
see below that when, in addition to the fact that all probabilities must add up to one, both the 
expected value of the random variable and its correlations are known, Maxent gives the Jungle 
model. 


Maxent is a general principle which pervades science, see (Jaynes 2004). As a consequence, we 


feel comfortable enough by stating that Maxent is a reasonable principle to pick the probability 
distribution of losses for a credit portfolio, consistent with the available empirical data. 

Let us apply Maxent to a given credit portfolio: 

Our state space is fl = {(h,h, ■ ■ ■ ,In) \ U £ {0,1}, * = 1,2,..., N}, a set of discrete variables, 
li, taking 0 or 1 values, and representing the default / non-default state of the i-tli credit. As a 
consequence, the potential moments we might derive from the (unknown) probability distribution 
of losses are: 


• The first order moment, (li). This is the so called probability of default of the i-th borrower, 
Pi 

• The second order moment, (klj), for i / j. This is directly related to the so called default 
correlation between the i-th and j-th borrower, 


Pij 


_ Qij - PiPj _ 

\/pi(l- Pi)yjpj(l -Pj)' 


Qij •— 


(7) 


x We leave the ’’best” concept undefined 
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• The second order moment, (1,1 j) = (if), for i = j. However, since l, only takes values in {0,1}, 
it is true that if = li , so the knowledge of this second moment becomes irrelevant 

• In general, any power of li, if, with k being a natural number, becomes l, 

• The third order moment, (hljlk), for i / j / k, would correspond to the effect on the 
creditworthiness of the z-th borrower, assuming both the j-th and k- th borrower also default. 
This effect is conceivable in theory. However, and as far as we know, there is no serious 
discussion of this phenomenon in the credit literature. 

• Any moment of order higher than three is bound to the same discussion as the one for the 
third order moment above 

There is a general consensus among practitioners that the corresponding available empirical 
information for a credit portfolio can be summarized as: 

• The probability of default of a borrower can generally be estimated, either from CDS for liquid 
names, or from Internal Ratings models for illiquid bonds or loans. Estimates tend not to be 
too noisy 

• The default correlation between two borrowers is harder to estimate that the corresponding 
probabilities of default. There are no financial instruments similar to the CDS to imply the 
default correlation, or if there are, they tend to be illiquid and over-the-counter (opaque infor¬ 
mation), and providing noisy estimates. Having said that, and despite the practical difficulties 
for its estimation, the consensus is the default correlation exists, it can at least be measured 
in some cases, and it is a key variable to understand default clustering 

• Third, and higher, order moments bear no specific names in the credit arena 

As a consequence, we claim that (at least in our mental framework, which consists of disre¬ 
garding dynamical, ”from-first-principle” equations, and only considering probability distributions 
arising from imposing empirical constraints to Maxent) the empirical available information for 
credit portfolios can be summarized in the probabilities of default and default correlations of its 
constituents. 

Maxent selects the Jungle model as its preferred probability distribution for credit losses, con¬ 
sistent with the available empirical data, as seen in the next section. 


4. The Jungle model and credit risk 


We consider a credit portfolio of N credit instruments, with a space state H = {(l\,l 2 , ■ ■ ■ ,In) \ 
li 6 {0,1},* = 1,2,...,7V}. 

We consider the set ’’labeling” the N nodes, 0 = {1,2,..., AJ} and the set ’’labelling” the 
N(N - 1) 

--- pairs of nodes, <P = {(*, j) \ i = 1, 2,..., N & j > i\ (the pair ij and the pair ji are 

considered to be the same), and two subsets of those, 0 6 0 and (j) 6 
In consistency with the previous section, we assume the full available empirical information of 
the corresponding credit portfolio can be summarized as the probabilities of default and the default 
correlations of its constituents. 

We will always consider 0 = 0, or in other words, we assume it is possible to give estimates of 
the probabilities of default for all the constituents in the portfolio, but (j) will usually be a proper 
subset of <P, meaning some of, but not all, the default probabilities can be estimated. The general 

N(N — 1) 

case will be one in which 1 <C card(</>) <C ---. 

Using the framework of Maxent, we claim that given the following empirical data, consisting of 
default probabilities and default correlations: 


Pi,\/i 6 9, with pi 6 [0,1] 

Pij,V(i,j) 6 i j>, with pij 6 [—1,1]; we define q,j such that the relationship 



December 2, 2015 


MV19'cont'20150923 


_ Qij ~ PiPj _ 

y/Pii 1 -P^VPji 1 ~Pj) 


Pij,V(hj) e <P holds 


Leading to the following empirical constraints: 


• Pi = (k), Vi £ 0 

• Qij = (klj),V{i,j) € <j) 

Maxent picks the Jungle model among all the probability distributions consistent with those 
constraints El 


where 


P(h,h, ■ ■ ■ Jn) = 7F ex P 


I ^ ^ Qt-ili T ^ ^ Pijlilj 

\i£9 (i,i)60 


( 8 ) 


z = J2^[ X] aili + Pijkh ) ( 9 ) 

n \i£6 (ui)6</> / 

The unknown parameters a, and have to be found by forcing the probability distribution 
gives the right estimates for the empirical information at our disposal, i.e. the following constraints 
are satisfied: 


Pi = (k) 


d log Z 
ddi 


( 10 ) 


Qij — {lilj) 


cllogZ 

dflij 


( 11 ) 


5. The Jungle model, hands on 

After showing the Maxent principle picks the Jungle model as the probability distribution of 
choice to analyse credit risk (assuming the full empirical information of the credit portfolio can be 
summarized as the probabilities of default and the default correlations of its constituents), we try 
and motivate the Jungle model, by studying some particular instances of the general model. 

To accomplish that goal, this section presents a few particular cases of the general Jungle model: 
the binomial model, adding small contagion to the binomial model, the Dandelion model and the 
Diamond model. 

As discussed in Section 3, we will state a probabilistic credit model has been solved once its 
probability distribution has been computed, either analytically or numerically. There are two ways 
to solve a model: 

• Z. the partition function, has been summed analytically. We will show the explicit calculation 
of Z for the Dandelion model below. We will use this methodology for the rest of Section 5. 


1 In the physics literature, the Jungle model is called the Ising model with external field, with both space dependent external 

fields and space dependent local interactions 
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• If the partition function cannot be summed analytically, Markov Chain Monte Carlo meth¬ 
ods may allow to generate realisations of the underlying probability distribution, P, without 
having to know its explicit form. With those realisations, all kind of averages of the distri¬ 
bution can be computed. We will apply this methodology in Section 7, when dealing with 
inhomogeneous portfolios and state-dependent recovery rates. 

In this section, we will show the Jungle model is able to introduce credit contagion in a way the 
standard credit portfolio models cannot. In particular, the Jungle model will be shown to model 
credit correlations under ’’normal economic conditions” (in a similar way to a Gaussian copula, 
at least at a non-quantitative level of discourse), but also to endogenously generate quasi-phase 
transitions, which can be understood as modelling systemic credit crises, arising ’’out of nowhere”, 
a phenomenon that by definition, a Gaussian copula cannot cope with (in other words, a model 
under a Gaussian copula never suffers from systemic crises). 


5.1. The binomial model 

For a credit portfolio whose probabilities of default are known and equal to each other, {pi := p \ 
i = 1,2,..., N} but whose default correlations {pij \ i = 1, 2,..., N &; j ^ i} are unknown, the 
probability distribution chosen by the Maximum Entropy principle is: 


P(M2,-‘-,M = ^exp (12) 

Due to homogeneity, the distribution above becomes the binomial distribution: 

p (x>=4 = (T)^ {i ~ p)N ~ e (i3) 

with the identification p = 1+ *_ Q . In other words, for the uncorrelated portfolio, the parameter 
a can be interpreted as (a simple function of) the probability of default. The proof of this result is 
given in Appendix B. 

Since the binomial distribution corresponds to independent defaults, it makes intuitive sense 
the Maximum Entropy principle selects it when there is no information whatsoever on empirical 
correlations. 


5.2. Small contagion 

In the previous subsection, we have seen the Jungle model with {«* := a \ i = 1,2,... ,N} and 
{/ 3ij = 0, V(i, j) £ (j)}, becomes the binomial distribution. Then, the probability of default of the 
credit instruments becomes (a simple function of) a. 

We might ask ourselves which would be the effect on the portfolio of adding the minimum amount 
possible of f3ij (a small /3 to only a pair of nodes, say 12, and Pij = 0 for any other pair of nodes ij 
different from 12) to the Jungle model corresponding to the binomial distribution (only with a), 
or in other words, we are interested in expanding perturbatively around the binomial model, in 
order to single out the effect of Pij, i.e. to see if Pij can be interpreted in relation to the empirical 
parameters, p and p, in the same way for the binomial distribution we could interpret a as (a 
simple function of) the underlying p. 

The corresponding probability distribution for the losses of that portfolio is: 
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Pp{h,h,--- , In) = ^ ex P + Phhj (14) 

The answer to our question is p^ 2 is proportional to j3, for small (3 and for a given probability 
of default, as it can be seen from Appendix C. In other words, when small amounts of contagion 
are added to an uncorrelated credit portfolio, the default correlation increases (from 0). And the 
rate of increase is proportional to /3, so for small contagion, the coefficient f3 can be interpreted 
as (a simple function of) the default correlation, in the same way that for no contagion, a can be 
interpreted as (a simple function of) the probability of default. 

Also, we can see that //j* = p 2 , by symmetry. But pf > p^ _0 , with the increase being proportional 
to (3. Instead, pj = pj~°,j = 3, • • • , N, since the nodes j = 3, • • • , N are not affected by contagion. 

In other words, when some contagion is added, it is not true any more that a is (a simple function 
of) p, since there is a ’’mixing” between a and (3 and their relationships with respect to p and p. 

We want to emphasize that the model above does not correspond to a credit portfolio whose 
probabilities of default are known and equal to each other, {p* =: p | i = 1,2,..., N} and the 
default correlation is only known for the pair of nodes 12, (3\2 =: (3 and 3{j = 0, for ij ^ 12. 

As we have seen above, the probabilities of default for the model described by: 


Pp(h, h,-" ’ 1 n) = exp ^ k + Phhj (15) 

are not the same for all nodes: credit instruments with a contagion link, such as h, experience 
an increase in their probabilities of default, with respect to those nodes without a contagion link, 
such as I 3 . 

The probability distribution satisfying the empirical conditions such that the probabilities of 
default are known and equal to each other, {p* =: p | i = 1, 2,..., N} and the default correlation 
is only known for the pair of nodes 12, ( 3\2 =: /3 and = 0, for ij / 12 is: 


Pp(h,l2, ■■■ ’ 1 n) = ^ exp ya 0 (h + l 2 ) + a^k + Phhj (16) 

Where «o is such that the constraint (h)p = (h)p = p is satisfied, being different from the a 
required to satisfy the constraint (h )/3 = ■■■ (In) p = P- For this case, it is also true that for small 
/ 3 , the default correlation of the pair 12 is proportional to j3. 

In the general Jungle case in which the model contains both cq and /3ij: 


P(h,l 2 , ■ ■ ■ Jn) = 7F ex P 


I ''y ( o!iii t y ( fiijiiij 
\i£0 b,i)e</> 


(17) 


it will not be true any more that a is (a simple function of) the probability of default, and {3 
is (a simple function of) the default correlation: for the general Jungle case, there is a ’’mixing” 
between a and (3 and their relationships with respect to p and p. 
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5.3. The Dandelion model 

The Dandelion model corresponds to a Jungle model with N + 1 borrowers, such that the first one, 
defined as i = 0 and considered to be at the centre of the Dandelion, is ’’connected” to all remaining 
borrowers, at the external surface of the Dandelion, such that /Jo* =: /3 / 

Any other borrowers remain unconnected, fiij = 0 for i = 1,2, ■ ,N & 
we assume Oii =: a for i = 1,2, • • • , N. 

The probability distribution for the Dandelion model is: 

1 / n N \ 

P(h,l2, ...,In) = v^exp ( a o l ° + «y ^lj + P^loli J 

V i =i i =i / 

The Dandelion model, despite being interacting, can be fully solved, with 
bution for its losses given by: 


P(£ = ^ 4) = 4 ( ex P M) + ex P (“o + + £))) ( 19 ) 

i=i x ' 


0 for i = 1, 2, • • ■ , N. 
j > i. For simplicity, 

(18) 

the probability distri- 


where Z is given by: 


Z = (1 + e a ) N + e a °(l + e a+f} ) N 

and a, ao and j5 are given explicitly as functions of the empirical data, p, po and p: 


( 20 ) 


oto = (N — 1) log ( 1 P ° ) + A^log 


P o 

a = log 

P = log 


po-q 


i-po-p + q 
p-q 

i-po-p + q 

q l-po-p + q 


K po - q p-q 

where q can be derived from the definition of default correlation: 


( 21 ) 

( 22 ) 

(23) 


P = 


q-ppo 


Vp( 1 ~p)Vpo( 1 -Po) 


(24) 


The proof can be found in Appendix D. 

To provide intuition for the Dandelion model, we have calculated its probability distribution for 
a set of reasonable parameters, N = 800 and p = po = 2.8%, which correspond to the historical 


default rate average for global speculative-grade bonds, as per (Moody’s Investors Service 2011), 


and for a given range of possible default correlations. The result can be found in Figure 4: 

The probability distributions in the chart show a ’’double peak” pattern: on one hand, a first 
peak, centred at low losses and not unlike the corresponding peak for a binomial distribution. On 
the other hand, a smaller but not insignificant second peak, corresponding to a high level of losses, 
and consistent with avalanches / domino effects due to contagion. 
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N=800 p=0.028 p 0 =0.028 



Figure 4. Probability distributions for the losses of the Dandelion model, corresponding to different default correla¬ 
tions 


The higher the default correlation, the higher the extreme losses (the second peak moves further 
to the right on the chart). Also, the higher the default correlation, the lower losses on the first 
peak. Contagion works both ways: defaults lead to more defaults (with respect to the binomial 
case), non-defaults lead to more non-defaults (with respect to the binomial case). These two effects 
can be seen more specifically from the two insets in the chart. 

Also, the higher the correlation of default, the higher the Value at Risk and the Expected 
Shortfall. The dependency of these two risk measures with respect to the corresponding default 
correlation is exemplified by the following table (at the 99% confidence level). 


p 

VaR 

ES 

0.00 

0.041 

0.044 

0.01 

0.043 

0.046 

0.02 

0.049 

0.055 

0.04 

0.069 

0.076 

0.08 

0.109 

0.117 

0.16 

0.188 

0.198 

0.32 

0.344 

0.356 


The Dandelion model can be understood as a bridge between macroeconomic risk factors and 
contagion. Specifically, in the derivation of the Dandelion model in Appendix D, the following 
equation arises: 


V = (1 - Po)p(ct) + Pop{a + /3) 


(25) 


where 


p{a) = 


1 + e~ a 


(26) 
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corresponds to the relationship between p and a described for the binomial (non-interacting) 
case. 

As a consequence, the central node in the Dandelion could be interpreted as endogenously gener¬ 
ating a ” macroeconomic state of the economy”, whereby for a fraction of time given by 1 — po the 
economy remains in a ” good” state of the economy, with a probability of default for its constituents 
given by p(a) = 1 + g_ Q , and for a fraction of time given by po the economy remains in a ’’bad” state 
of the economy, with a probability of default for its constituents given by p(a + j3) = ^ V-.u , 
where p(a + (3) > p(a), and the difference is accounted by the ’’contagion factor” (3. 

In other words, the Dandelion model endogenously generates a kind of mixture of binomials, able 
to generate a doubly peaked distribution and clustering of defaults. 


5.4. The Diamond model 

The Diamond model is defined by: 


/ N 

exp a ^ l j + (3 

I1J2 } ...,Zjv y i=\ 

The Diamond model describes a set of credits, all interacting among each other. For example, if 
JV = 4, node 1 could be a bank, node 2 a cement producer, node 3 a real estate developer and node 
4, a car dealer. The cement producer, the real estate developer and the car dealer get financing 
from the bank, so there are correlations of defaults between the pairs 12, 13 and 14. Also, the 
cement producer is a supplier to the real estate developer, so the pair 23 is also correlated. Finally, 
workers at firms 2 and 3 purchase cars from the car dealer, so a default of 2 or 3 would impact on 
4 business, creating also default correlations between 24 and 34. 

The partition function for the Diamond model is given by: 



E 

i>j 


lilj 


(27) 


N 


z = J2 ( N ») ex p ((«- ?x+1" 2 

e=o 


2 ' 2 

And the corresponding probability distribution for the losses will be: 


(28) 


N 


Pi! = 5 >) 


i= 1 


exp ((a — f )l + f 


(29) 


We can relate the empirical data, p and p to the model parameters a and /3, from the following 
two equations which can be inverted numerically: 


»> = zv E X V exi> - X+f 


q = 


ZN(N — 1) 


N 

E 

e=o 


e=o 

N\ 1 „ 


x+f/) 

( 30 ) 

x+f* 2 ) 

( 31 ) 


Appendix E gives a proof of the previous statements. 
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The Diamond model clearly exemplifies one of the most interesting phenomena of the Jungle 
model: quasi phase transitions. 

Let us see how the probability distribution of losses for the Diamond model changes, when we 
smoothly change default correlations, for the probability of default fixed at a given level (with 
parameters N = 20 and p = 40%, for easiness of visual inspection; below, we will provide another 
example, with N = 50 and p = 2.8%): 





Loss (%) Loss (%) 

Figure 5. Loss probability distributions for default correlations below, around and above the quasi phase transition 
point 

We can see there is a sudden change of collective behaviour for the probability distribution of 
losses when we smoothly change p from 10% to 20% to 30%, at some point between these default 
correlations: 
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For default correlations at around 10% or below, the Diamond model presents a standard be¬ 
haviour with losses spread with a given width around the expected value, 40%. However, when the 
default correlation increases only slightly (to 25%, say), a different behaviour for the probability 
distribution of losses starts to emerge: the probability distribution for the losses becomes bimodal, 
as it can be seen from Figure 5. And the more the default correlation increases, the larger the 
potential losses out of the second peak on the right. 

Another numerical example, this time with N = 50 and p = 2.8%, the average default rate for 
speculative-grade bonds in the ( Moody’s Investors Service||2011 ) sample, shows how a quasi phase 
transition changes dramatically the risk profile of the loss probability distribution, given small 
changes of the empirical values determining the portfolio (probabilities of default and, especially, 
default correlations): 
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Figure 6. Cumulative probability loss distributions for default correlations below, around and above the quasi phase 
transition point 

From Figure 6, we can see a sudden jump for the Value at Risk at the 99.9% confidence level, 
given a small increase in the default correlation 0 

The above phenomenon can also be analysed by looking at how the empirical variables, both the 
probability of default and the default correlation, change when the model parameters, a and (3, 
change, as shown in Figure 7. 

Despite the fact that for a finite N, there can be no (exactly discontinuous) phase transition, 
from the figures above we can see a sharp, almost discontinuous behaviour throughout a clearly 
visible diagonal line in the space of model parameters. By analogy, we give the name ’’quasi phase 


lr This is the explanation for the use of the ”phase transition” concept, borrowed from Statistical Mechanics. Phase transitions 
suffer a sudden jump in a given variable, induced by a small change in another, underlying variable. However, a quasi phase 
transition is not a phase transition, as properly defined in Statistical Mechanics. For example, phase transitions for the Ising 
model, the equivalent of the Jungle model in Physics, cannot happen for finite AT, and throughout the paper we assume N is 
always finite. 
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transition” to that phenomenon. The existence of such a line, ending in the so called ’’critical 
point”[^J is well known in the Condensed Matter physics literature. 

Since there is a known relationship between the empirical parameters, the probability of default 
and the default correlation, and the model parameters, a and /3, an analyst can use either the 
empirical parameters or the model parameters to describe the behaviour of the model. The model 
parameters tend to be more useful to analyse the behaviour of such systems, because as observed 
from Figure 7, a small change in the empirical parameters always results in a small change for the 
model parameters, but the opposite is clearly not true (at the line of quasi phase transitions): 



Figure 7. Probability of default and default correlation for a given set of model (normalized to values between 0 and 
1) parameters, a and /3, and N = 80. 

Also, any credit portfolio is driven by underlying, fundamental economic factors (both macroe¬ 
conomic and microeconomic). As a consequence, we can understand the evolution in time of a 
credit portfolio described by such a model as the smooth change of the model parameters when the 
underlying, fundamental economic factors change. Usually, the empirical parameters will be such 
that the system will move around the bottom left corner of the figure 7 (relatively low probabilities 
of default and default correlations). 

However, when the economic conditions are such that the system is close to (but still below) 
the line of quasi phase transitions, a small change in the underlying, fundamental economic factors 
may lead to a small change in the model parameters, such that the system may inadvertently cross 
the line of quasi phase transitions, resulting in an abrupt, almost discontinuous, change in the 
empirical parameters. 

As a consequence, the Diamond model shows anti-intuitively that the collective behaviour of the 
portfolio may significantly change due to small changes of the empirical values determining it. 

This phenomena is not unlike the phase transition of water into steam: if we increase one degree 
Celsius the temperature of water at 98 degrees Celsius, the resulting water at 99 degrees Celsius 
continues ’’being water” (small details will change, for example a thermometer inside the water 
will show a small increase in its readings, but water will remain ’’being water”). However, when the 
temperature increases a further degree Celsius, there is a sudden change in the collective behaviour 
of water, becoming a coexistence of steam bubbles and liquid. 

So, a small change of the underlying parameters leads to a significant change of the behaviour 
for the whole system. This is surprising, since if we could solve all the dynamical equations of 


2 It is known in the Physics literature the critical point lies at a = —2, /3 = A in the large N limit, corresponding approximately 
to a probability of default of 44% and a default correlation of 11% for N = 80. 
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motion for the say 10 23 particles in a litre of water, it seems unlikely that with that knowledge we 
could have forecasted such a dramatic change of behaviour. It is the averaging out of ’’irrelevant” 
degrees of freedom, undertaken by statistical mechanics, which allows to keep only the (small set 
of) parameters which really matter at the level of one litre of water. 

Analogously, the Diamond model shows a quasi phase transition from a phase dominated by a 
’’binomial-like” behaviour, whereby losses spread over a given width, centred around the expected 
loss, towards a coexistence region, dominated by avalanches due to credit contagion, and determined 
by a doubly peaked distribution. The transition from one phase to the coexistence region is caused 
by a smooth change of the empirical parameters defining the portfolio (probabilities of default and 
default correlations). However, the variation in the global shape of the probability distribution 
changes significantly the risk profile of the portfolio, potentially inducing systemic risks. 


5.5. The Jungle model and the real world 

In a general, real world credit portfolio, a Jungle model will be defined by its topology, 0 and q\ 
as well as by the given empirical data, consisting of pi and pij over 0 and 0. 

The data will always be such that 0 = 0, or in other words, we consider it is possible to give 
estimates of the probabilities of default to all the constituents in the portfolio, but 4> will usually 
be a proper subset of <P, meaning some of, but not all, the default probabilities can be estimated. 

N(N - 1) 

The general case will be one in which 1 <C card(0) <C ---. 

Pictorially, the network corresponding to that credit portfolio will be a possibly random com¬ 
bination of links connecting many nodes in the network. But quite often, the analyst will be able 
to recognize Dandelion shapes (possibly centred at banks or other large corporates) and Diamond 
shapes, among others. As a consequence, the ability to solve exactly these three interacting models 
may prove helpful. 


6. The Jungle model and model risk 

Above, we have shown the Jungle model becomes the binomial model when no information whatso¬ 
ever about correlations is known. This result is intuitive, since the binomial model describes losses 
for independent defaults. 

In general, when there is correlation among some of the underlying credit instruments, the Jungle 
model will naturally depart from the binomial model. 

Model risk in the framework of credit portfolio modelling is the risk that a given probability 
distribution for the losses underestimates the tail risks (with respect to empirical evidence). 

Pictorially, we could think of a ’’functorial” assigning a probability distribution of losses for 
each theory. For example, the functorial would assign the binomial probability distribution to the 
’’binomial theory”, and it would assign the Jungle probability distribution to the ’’Jungle theory”. 

We could try to parametrize the space of potential theories. For example, we could assign param¬ 
eter 1 to the binomial theory (given that the binomial theory depends on a parameter p, directly 
related to its first moment), we could assign parameter 2 to the Jungle theory (given that the 
Jungle model depends on parameters p and p, directly related to its first and second moments) 
and so on. 

It seems clear that restricting to parameter 1 (most standard credit models can be understood as 
straightforward generalizations of the binomial theory) creates a significant model risk. Empirically, 
this has been seen during the recent financial crises in the Western world. 

An analyst could believe modelling credit risk with theories with parameter 2, i.e. the Jungle 
model, would be clearly better than restricting to parameter 1 (since the binomial theory is a 
particular case of the Jungle model). However, it seems that theories with parameter 2 are only 
slightly ’’larger” than theories with parameter 1, but the whole space of theories is vastly larger than 
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2. We could see ourselves then climbing the ladder, enlarging progressively the space of theories, 
but always aware that we would suffering from a massive model risk, since the ’’real theory” could 
be one with n > 2 parameter, whatever n could be. 

Empirically, probabilities of default can be obtained easily for a wide range of companies. Default 
correlation data is not as easily available as probabilities of default are, but it is a relevant parameter 
for practitioners and academics alike. However, expected values of cross-products of three or more 
li, such as (liljlk), f° r if- j k, are not known. As far as we know, these terms have not been 
seriously considered in the related literature. 

As a consequence, probably the Jungle model is not the most general credit risk model. The 
’’correct” credit risk model could be one with n > 2 parameter Q and this is unknown to us. 
Having said that, Maxent picks the Jungle model as its credit risk model of choice, in consistency 
with the available empirical data (probabilities of default and default correlations). 

In other words, the Jungle model is the best we can do with the empirical credit data at our 
disposal. 

Another issue related to model risk is what do we mean by ” available empirical data”: any sample 
data is bound to intrinsic uncertainty. Not only because of fluctuations over time, but also due 
to imperfections in how data is presented and collected (there are hundreds of different day count 
conventions in finance, for example). 

And small fluctuations in the empirical data may have a big impact on the selected model. In 
the presentation so far, we have not discussed how do we select in practice the parameters in the 
Jungle model, a* and Pij, to fit with the empirical data pi and pij , apart from stating (as per 
Maxent) that the constraints: 


Pi = (k) = 


d log Z 
dai 


(32) 


Qij — (J'iJi'j ) — 


<9 log Z 

dfiij 


(33) 


have to be satisfied. In the previous section, we were able to invert analytically the relationship 
of pi and with respect to ct; and pij for the Dandelion model. For the Diamond model, we 
showed the corresponding equations can be solved numerically. 

However, for a general Jungle model, the situation is probably much more precarious. We might 
have a large amount of bonds and loans, N, also N probabilities of default, and a large number 
(much larger than 1, but much smaller than the maximum possible amount of links, Q f 

default correlations among the borrowers. And it could perfectly be possible we could not sum the 
partition function Z analytically, so we would need to resort to MCMC methods. 

In this situation, a ’’Jungle inverter” (i.e., a function providing on and Pij on 0 and p, given a set 
of empirical values for pi and p^ on 9 and p, see for example (Rou di et a/.| |2009)) would be noisy. 
A good ’’Jungle inverter” could give the ’’correct” a* and Pij if supplied with the ’’correct” pi and 
Pij. But as discussed above, it will never be possible to provide the ’’correct” empirical data, we 
will always be using sample data, prone to an unavoidable margin of error. 

We suggest the following way of thinking about model risk: 

The empirical data pi and p t] has not to be thought as a point in a (N + N ^ N 2 — )-dimensional 
space, but as a (IV + N ^ N 2 ^ )-dimensional cube centred at p % and p^, with a certain width, 5pi and 
S Pij. 


x If new empirical data were known in the form of higher order moments, the framework in this paper could cope with that. In 
fact, in Statistical Physics this kind of extended Jungle models, including trios and higher order interactions, have been studied 
extensively 
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We should then sample randomly a point from that cube, Pi and pij. The Jungle inverter would 
give us a set of of and fy. Sampling again from the cube, we would get another set of parameters, 
and so on. 

Due to the large scale of the inversion, it seems reasonable to assume that even for small 5pi 
and 5pij, different samples will yield significantly different topologies and a t and pij parameters, 
resulting in potentially large Jcq and 8 Pij. 

Our position on that issue is as follows: since we have by hypothesis gathered all the possible 
empirical information on our portfolio (summarized in probabilities of default and default correla¬ 
tions), and since we have argued Maxent picks the Jungle model as the credit risk model of choice in 
consistency with that data, and since the uncertainty on our empirical information is unavoidable, 
all those different models are to be considered. 

As a consequence, model risk analysis would be adamant to analyse not only ’’the” model consis¬ 
tent with our empirical data, but all the models consistent with our empirical data (probably, there 
are many of them). This is not only a theoretical argument. For example, the Diamond model case 
shows that for a not too unreasonable set of probability of default and default correlation data, the 
probability distribution of the losses suffers a dramatic transformation (a quasi phase transition) 
when changing smoothly the empirical variables, especially the default correlation. 

If one of the theories consistent with our empirical data were a theory having a quasi phase 
transition point in the vicinity of the parameters cc* and Pij consistent with our empirical data p % 
and p ^, by disregarding that model we would be inadvertently creating a significant model risk. 

This way of thinking is consistent with ’’what if’ scenario analysis: it does not matter so much 
precision (i.e., being able to derive the ’’correct” cq and Pij consistent with our empirical data p % 
and p^), but robustness, i.e., we know that our data gathering process is imperfect and we know 
our Jungle inverter may not be able to find always the ’’right” a t and Pij consistent with our 
empirical data p t and p ^; for this reason, we consider a set of potential future variations of the 
empirical parameters (possibly hard coded through ’’expert opinion”) and we analyse what would 
happen in case those scenarios are realized. 

Finally, we would like to analyse when our procedure to select the Jungle model as a relevant 
credit risk model, the Maxent principle under a set of empirical constraints, could fail to be a valid 
one. In other words, we would like to make ’’model risk on model risk”. 

The underlying hypothesis in this paper (apart from the conditions on empirical data outlined 
above) is we can model credit portfolios without the need to resort to the underlying, ’’microscopic” 
dynamical processes. 

In particular, this hypothesis is implicit when we apply Maxent to a given credit portfolio, 
provided the empirical data, consisting of default probabilities and default correlations, can be 
obtained as follows: 


Pi, Vz G 9, with pi G [0,1] 
Pij,V(i,j) G (j>, with pij 


G [—1,1]; we define qij such that the relationship 


Qij - PiPj 


= Pij,V(i,j) G 4> holds 


yJpi(l~Pi)^/pj{l ~Pj) 

Maxent leads to the following empirical constraints: 


• Pi = (k), V* e 9 

• Qij = (klj),V(i,j) € p 

The underlying hypothesis is that during the time frame in which pi and pij are fixed numbers 
(i.e., a time period below the typical time frames of change in those empirical variables), the 
’’microscopic” variables l t fluctuate fast enough in order to be able to sample the whole space of 
states, and generate a meaningful value for both {Ip and (klj). 

If that is not the case, i.e. if the ’’microscopic” variables lj fluctuate slowly, or conversely, the 
empirical variables p t and p t3 fluctuate fast (in comparison to each other), the Maxent results do 
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not need to hold in practice. 

It seems reasonable to assume that under a ’’good” state of the economy, the p% and pij empirical 
values will fluctuate smoothly. Also, a credit portfolio with a low degree of default correlations 
(such that a binomial provides a good approximation for it) will probably ’’relax” fast towards its 
equilibrium configuration. As a consequence, a not-too-correlated portfolio, under ’’good” economic 
conditions, will probably satisfy the implicit conditions in Maxent, so the Jungle model framework 
will hold. 

However, under a ’’bad” state of the economy, the pi and p^j empirical values will probably suffer 
strong and sudden fluctuations. Also, a credit portfolio with a high level of default correlations 
might ’’relax” slowly towards its equilibrium configuration. For example, it could be the dynamical 
processes (unknown to us) generate a state space with many local minima, and the system might 
be trapped in a local minimum which is not the global one, and the more the time passes, the 
more likely it becomes that eventually the system jumps out of that local minimum towards the 
another local minimum, searching the global one. In that case, the averages we would measure, 
(li) and (hlj), would not be measurements on the whole probability distribution, but only on a 
small part of the state space, making the overall effort worthless (or even outright dangerous, for 
macroprudential purposes). 

As a consequence, a highly correlated portfolio, under ’’bad” economic conditions, may not be 
correctly described by Maxent, so the Jungle model framework will not necessarily hold true. 

To sum up: 

• The Jungle model may not be the ’’best” possible credit portfolio model (whatever ’’best” 
means), but at least, the Jungle model is selected by Maxent to be the credit portfolio model 
of choice, in consistency with the available empirical data. 

• Model risk under the Jungle model should be thought of as an ensemble of Jungle models, 
defined by o: t ± 5a.i and /3ij ± <5/3^, which are consistent with the empirical data pi ± 5pi and 
Pij ± 5p^. In particular, 5ai and 5(3ij could be large, even for small 5pi and 5pij. An analyst 
should study the possibility of the existence of double peaks and quasi phase transitions for 
the corresponding models, since a small change in the underlying empirical data, i.e. small 
dpi and 5pij, may lead to sudden and dramatic systemic events^] 


7. Modelling inhomogeneous portfolios and recovery rates 

In Section 3, we have discussed the modelling of credit losses for a general credit portfolio, including 
not only stochastic bi-valued indicators, but also possibly state-dependent recovery rates, and 
inhomogeneous exposure at default values. 

In Section 4, we have simplified the above general case by only considering stochastic li bi-valued 
indicators. 

We now show the general case of Section 3 can be handled with the Jungle probability distri¬ 
butions of Section 4. In other words, the problem of obtaining the general probability distribution 
for the losses of an inhomogeneous portfolio with state-dependent recovery rates can be decoupled 
into two smaller problems: 

• First, find the Jungle probability distribution for the losses of the corresponding homogeneous 
portfolio with no recovery rate modelling (using the empirical probabilities of default and 
default correlations of the borrowers in the portfolio) 

• Once the Jungle probability distribution is found, model the general case of the inhomogeneous 
portfolio with state-dependent recovery rates, by sampling the Jungle probability distribution 


1 In the same way we could not know water at 99 degrees Celsius was going to become steam when increasing temperature by 
one more degree. 
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with Markov Chain Monte Carlo methodology, and calculate trivially the corresponding losses 
in the general case for each realization of the Jungle probability distribution 

Let us show the procedure outlined above for both inhomogeneous portfolios with no recovery rate 
modelling, and for homogeneous portfolios with state-dependent recovery rates, before handling 
the general case. 


7.1. Modelling inhomogeneous portfolios, no modelling for recovery rates 

First, let us consider the intermediate case of a credit portfolio being modelled by stochastic l, 
bi-valued indicators and inhomogeneous exposure at default values, but no recovery rates. In that 
case, the total loss of the portfolio can be described as: 


N 


N 


l — y ' Li ~ y ' Eiii 


(34) 


Z=1 


i=l 


The probability distribution for L can be computed numerically, using the equation above, by 
using the probability distribution arising from the corresponding homogeneous Jungle model. 


7.2. Homogeneous portfolios with state-dependent recovery rates 

The Jungle model can also handle the case of state-dependent recovery rates for homogeneous 
portfolios, when the recovery rates follow the stylized fact of being lower when the overall default 
rate increases (and vice versa), see (Mora 2012). 

For simplicity purposes (and without loss of generality, as shown in the next subsection), let us 

1+ 1 

assume a linear dependence of the recovery rate with the overall default rate, 1 — RR = where 

N 

Et 

i is and p being the expected value of i. 

The expected value of 1 — RR is one. However, since the recovery rate decreases when the default 
rate increases, the total loss L will show the non-trivial correlations between the recovery rate 
and the default rate through an increase in the high loss region (with respect to the case of no 
state-dependent recovery rate). 


(Mora 2012 


7.3. Inhomogeneous portfolios with state-dependent recovery rates 

In general, a real world portfolio will be inhomogeneous, and the recovery rates of its constituents 
will be state-dependent, possibly in a specific way for each borrower. 

This general case is amenable to computation with the MCMC methods outlined above, and the 
generalization is straightforward for: 

• Inhomogeneous portfolios 

i+* 

• State-dependent recovery rates such as 1 — RR = 

1+ 1 

• State-dependent recovery rates with a more general functional than 1— RR = —j 2 -, for example 
with non-linear terms in l, or with borrower specific coefficients 

• State-dependent recovery rates that depend not only on £, but on the states of individual 
borrowers, 1 ,, for the z-tli borrower 

• In general, MCMC allows to compute the probability distribution of any function whose 
domain is the state space, (f(h,l 2 , ■ ■ • ,In)} 
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In particular, we want to highlight another suggestive possibility: 1 — RR = a + blo, with Iq being 
the indicator of the central node in a Dandelion model. We have discussed above the Dandelion 
model introduces a relationship between macroeconomic risk factors and contagion, unifying both of 
them. In particular, macroeconomic risk factors could be understood as a specific, large Dandelion 
effect. 

As a consequence, a functional form of 1 — RR = a + bio would mean that in the ’’good” state 
of the economy, with Iq = 0, the loss given default would be ’’low” (a, say 20%). Instead, in the 
’’bad” state of the economy, with Iq = 1, the loss given default would be ’’high” (a + 6, say 70%). 
Again, such a modelling is amenable to MCMC calculations for the corresponding Jungle model. 


8. Contagion, macroeconomic risk factors and frailty 

In this section, we point out the three factors contributing to default clustering (macroeconomic 
risk factors, contagion and frailty) can be understood under the unifying framework of contagion. 

In particular, we show macroeconomic risk factors can be modelled as a particular case of con¬ 
tagion, generalizing the ” Dandelion trick”. 

Also, we motivate frailty can be interpreted as an instance of contagion: when suddenly a hid¬ 
den risk factor is revealed to the market, the effect is an abrupt and discontinuous change of the 
empirical parameters defining the credit portfolio, i.e. its probabilities of default and default corre¬ 
lations. Frailty can be thought of as the effect of this jump on the parameters of the loss probability 
distribution. 


8.1. Macroeconomic risk factors as contagion 

Earlier, we have seen the Dandelion model can be understood as a mixture of binomials, whereby 
the probability of default of a node in the periphery of the Dandelion can be decomposed as a 
mixture of ’’good” and ’’bad” probabilities of default, each corresponding to the two binomials in 
the mixture. 

Here, we want to show the ’’Dandelion trick” can be generalized for the Jungle case, to generate 
a mixture not only of binomial distributions, but of any distribution arising from an interacting 
model: 

Given a Jungle model of N credit instruments, defined by pi,i = 1 ,■■■ ,N, and pij,i = 
1 ,■■■ ,N,j > i, the corresponding probabilities of default and default correlations of its con¬ 
stituents, we can define a mixture model by: 

• For a fraction of time po , the probabilities of default of the N credit instruments are 

BAD . . . n BAD 
P 1 5 ) PN 

• For a fraction of time 1 — po, the probabilities of default of the N credit instruments are 

...GOOD (GOOD 

Pi i ■ ■ ■ i/hv 

By extending straightforwardly, but rather lengthily, the ’’Dandelion trick” for a general Jungle 
model, we can create mixture models of credit portfolios in a natural way. As a consequence, 
mixture models arising from the variation over time of macroeconomic risk factors can be embedded 
naturally into the framework of contagion. 


8.2. Frailty as contagion 

In this subsection, we want to motivate the use of contagion to explain, at least pictorially, the 
phenomenon of frailty: 

Frailty is described as the ’’Enron effect”: when ’’Enron” was considered to be a good company, 
its probability of default was low. However, some day the market discovered ’’Enron” had ’’cooked 
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its books”. Immediately, ’’Enron”’s probability of default sky-rocketed. 

But the impact of that discovery did not end there: analysts started wondering if other compa¬ 
nies, in other geographic regions and in other economic sectors, probably completely unrelated to 
’’Enron”, had been cooking their books, too. 

As a consequence, once the corresponding ’’hidden factor” was ’’revealed” to the market, the 
probabilities of default of many different companies throughout the ’’credit network” jumped up¬ 
wards, in analogy to what happened to ’’Enron”’s probability of default. This behaviour is not 
unlike asthma (pollution may trigger a common response among people suffering asthma; but 
asthma is not like flu, the prototype of direct contagion). 

The net result of the ’’Enron effect” was the probabilities of default of a multitude of nodes in 
the ’’credit network” increased suddenly. The reason was not direct contagion, but the revelation 
of new information to the market which had been hidden so far. 

We could argue then that dealing with frailty, in the contagion framework outlined by the Jungle 
model, is adamant to applying ’’what if’ scenario analysis to a given configuration of the ’’credit 
network”. By definition, we cannot know what the ’’hidden factors” are. However, we know that 
whatever they are, their effect once they are discovered by the market, is a sudden jump for some 
of the empirical pi and p t j. 

Frailty can then be understood as the change in the loss probability distribution of a Jungle 
model, when the empirical pi and pij transform into a set of stressed values throughout the network, 
which can be quantified by the derivative of the model parameters, a and (3 , with respect to the 
empirical parameters, pi and p t j. 

As a consequence, frailty and contagion are intimately related, since the effect of frailty will 
largely depend on the contagion structure of the credit network. 


9. Policy implications of contagion 


In Section 5, we have described how the Jungle model depends on the probabilities of default and 
default correlations of its constituents, plus the topology of the ’’contagion network”. 

We have seen that for several topologies, with not-too-unreasonable values for the probabilities 
of default and default correlations, the probability distributions of the credit losses become doubly 
peaked, out of credit avalanches triggered by contagion. 

In particular, we have analysed how increasing the default correlation for the Dandelion model, 
leads to the second peak moving to more extreme losses (more extreme domino effects), as well 
as the first peak moving towards zero losses. The inability of some credit portfolio models to 
accommodate these stylized facts, even for some models used in practice for regulatory purposes 


such as (Vasicek 1987), has been highlighted by (Kupiec 2009) 


In the following sections, we will describe the policy implications suggested by the effects above. 


9.1. The U.S. subprime and the European sovereign crises as quasi-phase 
transitions 

Both the U.S. subprime and the European sovereign crises caused sharp spikes in probabilities of 
default ’’across the board”. However, such increases were mostly concentrated in specific sectors of 
the economy (e.g., financial corporates, such as Lehman and AIG, during the U.S. subprime crisis). 

For the following argument, it will be important to highlight that not only the probabilities of 
default jumped during the financial crises, but also the (average pairwise) default correlations, as 
it can be seen from Figure 8. 

Before the crisis, market expectations implied low probabilities of default and default correlations 
for the overall economy, in particular the same held true for the financial sector. Also, risk aversion 
was probably ’’low”, and as a consequence, the risk premium associated to the market values of 
both empirical parameters was also ’’low”. 
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Figure 8. Historical default correlation for both iTraxx XOver and CDX IG 

However, when the crisis erupted, both probabilities of default and default correlations spiked up 
for the financial sector as a whole. Even more, if it were possible to define the sub-sector of the most 
speculative parts of the financial sector (those companies exposed the most to subprime assets), it 
seems reasonable to assume the probability of default of such sub-sector reached exceedingly high 
levels, of the order of magnitude of 50%. 

From the analysis of the Diamond model, we have seen quasi phase transitions arise naturally. 
And the Diamond model, with equally pairwise default correlations, is the corresponding Jungle 
model associated to a homogeneous portfolio whereby all nodes are connected to each other. And 
the sub-sector of the most speculative parts of the financial sector constitutes a credit portfolio 
which probably can be correctly approximated as homogeneous, with each node connected to any 
other node in the network. In fact, the standard methodology to imply default correlations from 
traded credit indices, such as CDX IG or iTraxx XOver, assumes all their individual components 
have the same default correlation with every other name in the index. 

Then, for a Diamond model, and under ’’normal” (non-crisis) conditions, the fundamental sit¬ 
uation of the economy is such that both the probabilities of default and the default correlations 
are ’’low”, resulting in an economy located around the bottom-left corner of Figure 7. However, in 
a crisis, and for that sub-sector of the economy exposed the most to the key fundamentals of the 
crisis, the model parameters a. and j3 may get closer to the ’’line of quasi phase transitions”, from 
below. As we have observed before, the ’’critical point” of such line corresponds to a probability of 
default of 44% and a default correlation of 11% for a reasonable set of parameters. 

As a consequence, if for such a sub-sector, the impact of the deterioration in the fundamen¬ 
tal situation of the economy implies that the corresponding probabilities of default and default 
correlations are close (from below) to 44% and 11%, resp., just a little bit of further economic 
deterioration may result in the model parameters, a and /?, crossing up the ’’line of quasi phase 
transitions” (from the bottom left corner, to the top right corner). Such a small change may be 
immaterial in the model parameters space, but in the empirical parameters space, the impact is 
huge: both the probability of default and the default correlation spike up, from low levels (close to 
0%) to high levels (close to 100%). 

The policy implication of the discussion above is then as follows: monitor closely the model 
parameters, a and /3, of the sectors of the economy most exposed to potential future systemic 
crises (the financial sector being always one of such sectors), and raise a red flag once and if the 
model parameters are getting closer to the ’’line of quasi phase transitions”, or at least, a reasonable 
stress test suggests the model parameters might cross such line in case of a sudden macroeconomic 
/ microeconomic shock. 
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In particular, let us highlight that such a policy would have resulted in a ’’red flag” for both the 
U.S. subprime and the European sovereign crises at the beginning of such crises, and even possibly 
a bit before their sudden eruption, as it can be seen intuitively from the default correlation figures 
above. 


9.2. Understanding the historical probability distributions of credit losses 


The historical default rates provided by (Moody’s Investors Service|2011 ) (also data from (Giesecke 


et al\2t)Tl) would allow us to reach similar conclusions), presented in Section 2, yield the following 


histograms: 





Figure 9. Histograms for data in (|Moody’s~Investors Service|2011) 


Front visual inspection for the three figures above, there are too few data points to robustly 
ascertain if the probability distributions for the default rates have one peak or more. Intuitively, it 
seems the tail is a fat one, with credit loss realizations up to 100%, in the case of Caa-C rating. 

The question is then if this evidence contradicts our claim that the fact that Maxent picks 
the Jungle model as its probability distribution of losses of choice suggests the Jungle model is 
a reasonable credit risk model to be considered in practice, since the Jungle model is often (for 
several topologies) doubly peaked, as for example the Dandelion model. 

The answer is that it does not. The historical distribution of losses presented above can be 
understood as follows: 

Let us assume, without loss of generality, that the empirical probabilities of default and default 
correlations only change once per year, on Jan 1st. Let us assume the corresponding topology 
gives rise to a Jungle model generating doubly-peaked probability distributions. Then, the losses 
that year will be a realization of that particular Jungle model. Probably, the realization will fall 
under the first peak. But the more years we repeat the same procedure (with their corresponding 
probabilities of default, default correlations and topology), the more likely is a realization occurs 
on the second peak (contagion effects, generating an avalanche / domino effect of credit defaults). 

From the Dandelion model, we have found out the position of the second peak is largely deter¬ 
mined by the default correlation (the probabilities of default also matter). 

As time passes by, we will have a series of realizations of the second peak. But importantly, the 
empirical data for each realization (probabilities of default, default correlations and topology) will 
most likely be different for each year, probably generating a double peak at different location on 
the axis of losses for each realization of the second peak. 

As a consequence, the historical probability distribution of losses will probably have only a first 
peak, consistent with the fact that in the majority of realizations, losses are basically contagion¬ 
less, so that first peak will be roughly similar to the one of a binomial model, but wider due to 
the mixing with different macroeconomic conditions over several business cycles, and a fat tail 
generated by realizations of the doubly-peaked probability distributions arising from the Jungle 
model. 

This way of thinking allows us to understand how is it possible the tail of the empirical probability 
distributions is so ’’thick”: the tail is generated through individual realizations of double peaks. 
This way of thinking relaxes the need to include extreme probability distributions which are able 
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to cope by themselves with the difficult task to model both extreme default events, and default 
events in a ’’good” economy state. 

Even more, the Jungle model allows us to understand a stylized fact of the probability distribution 
of losses for highly risky portfolios, exemplified by Moody’s Caa-C rating data: despite the fact 
Caa-C rating bonds are highly risky (and there is even one year where 100% of bonds in the sample 
defaulted), it also happens very often that Caa-C rating bonds enjoy a default rate close to 0% 
(on the sample, there are several years with 0% default rate). In fact, this phenomenon of 0% 
default rate happens more often for Caa-C than for bonds with a much better rating, which seems 
intuitively odd, see (Kupiec]|2009). 

However, the Dandelion model is able to explain this stylized fact: for Caa-C rating bonds, it 
seems likely that the individual bonds are described not only by high probabilities of default, but 
also by high default correlations among themselves (or with a central node, in a similar way to the 
Dandelion model; possibly banks or other financial suppliers specialized on risky lending). 

From the charts in the Dandelion model section, we can see that in this region of parameters, 
the higher the default correlation, the larger the losses for the double peak. But in addition to 
this effect, also the higher the default correlation, the lower the losses for the first peak. This is 
consistent with a contagion effect: contagion not only works on ’’bad” situations (a default in a 
node induces a default in another node nearby), but also on ’’good” situations (a non-default in a 
node induces a non-default in another node nearby). 

As a consequence, this framework of thinking leads us to suggest that the most relevant variable 
to ascertain default clustering is not the probability of default (as standard rating classifications 
appear implicitly to suggest) but default correlations. 


9.3. How should the Jungle model be used in practice? 

The Jungle model could be extended straightforwardly to introduce macroeconomic risk factors. 
The Jungle model is based under the assumption that the probabilities of default and default corre¬ 
lations are known fixed numbers. An analyst could include into the modelling a specific pattern of 
conditional default probabilities (and possibly conditional default correlations as well), in the same 
way mixture models introduce default correlations by mixing the binomial model (independent 
defaults) with a specific choice of conditional default probabilities. 

However, the interest to do so is limited. On one hand, the Jungle model does not need to follow 
the above procedure to include default correlations, since default correlations appear endogenously 
in the model. This is unlike the case for the binomial model, which needs a mixture in order to be 
able to model default clustering. On the other hand, averaging over a conditional default probability 
means the corresponding probability distribution of losses is the one for the whole business cycle. 

We believe a probability distribution of credit losses for a full business cycle is of limited interest, 
since a macroprudential regulator or a bank senior risk manager is interested mainly in extreme 
events. So, what matters for the probability distribution of losses is the short future, if and when 
a ’’bad” economic scenario is realized. 

As a consequence, the Jungle model fits better with a ’’what if’ strategy for model risk: ’’What 
would happen to my credit portfolio if my topology, probabilities of default and default correlations 
change?” The reason is the Jungle model is the most general probability distribution for credit 
losses (under the assumptions presented in the Introduction and later sections), so changing the 
parameters allow the analyst to span the whole space of credit models. 

By exploring how the probability distribution of credit losses for the Jungle model changes once 
the topology, the probabilities of default and the default correlations change (either by econometric 
equations, or by expert knowledge), an analyst could get a robust feel for credit tail risks. 



December 2, 2015 


MV19'cont'20150923 


9.4. It’s the correlations, stupid! 


Using a pictorial, non-rigorous example, the above discussion suggests an economy full of ’’industrial 
entrepreneurs”, understood as risky projects on a standalone basis, but lowly correlated among each 
other, will probably have a lower systemic risk than an economy full of ’’financial large corporates”, 
understood as corporates with low probabilities of default, but highly correlated among each other 
(possibly through a strong, common dependence on financial conditions). 

A credit portfolio of ’’industrial entrepreneurs” will probably have a high expected loss and a 
wide first peak (which would intuitively suggest a high risk portfolio), but probably will have a 
small or negligible second peak, since domino effects will probably not be present. 

Instead, a credit portfolio of ’’financial large corporates” will probably have a low expected loss 
and a narrow first peak (which would intuitively suggest a low risk portfolio), but probably will 
have a significant second peak, since domino effects might be relevant if the common, ’’financial” 
factor underlying the performance of all those large corporates suffers a severe, unexpected, ’’Black 
Swan” crisis, a la (Taleb 2007). 

As a consequence, measuring credit risk in a bottom-up fashion, for example assuming a given 
credit portfolio is ’’low risk” because its individual constituents are ’’low risk” (in the sense of 
having a good credit rating), disregarding collective effects at the portfolio level, may severely 
understate the systemic risks of a credit portfolio as a whole. 


9.5. ’’Too Big To Fail” banks 


Similarly, the discussion above suggests the ’’Too Big To Fail” (TBTF) phenomenon for large 
banks, a relevant issue for macroprudential regulators, can be understood within our contagion 
framework. In particular, a portfolio consisting of a TBTF bank and its borrowers can often be 
modelled as a Dandelion model. 

The Dandelion model results show the probability distribution of losses for the Dandelion suffers 
from a significant double peak even for non-extreme probabilities of default and default correla¬ 
tions’ estimates. This double peak may lead to contagion effects when the TBTF bank is close 
to bankruptcy, so the regulator may have the incentive to ’’bail out” the TBTF bank when the 
regulator realizes the total losses out of the TBTF bank default (implying massive defaults on its 
borrowers, due to domino effects out of contagion) are simply not socially acceptable. A TBTF 
bank knows the regulator knows that, so the incentive for the TBTF bank is to grow as much as it 
can, and to create as much contagion as it can. This is an example of social and economic fragility, 
as described in (Tal eb||2 012). 

A rational regulator should be able to understand these perverse incentives, and pre-emptively 
regulate banks’ size, with a penalty for large sizes. 

As described above, the onus of the argument for bank regulation is not to be found by analysing 
the TBTF bank on an individual basis, but understanding the effect is an emerging, global one. In 
other words, the ’’capitalist freedom” of a bank to grow as much as it can is interfering with the 
’’capitalist freedom” of other economic agents in the economy without a say on the banks’ actions, 
but potentially suffering the severe consequences of a systemic crises when and if the TBTF defaults 
and creates an avalanche of defaults through contagion (capitalism should be an incentive system 
where one is rewarded or punished by the result of one’s actions, not by the actions of somebody 
else). 

This phenomenon is clearly an externality, and it may require proper regulation to safeguard the 
interests of society as a whole. 






December 2, 2015 


MV19'cont'20150923 


10. Conclusions 

This paper presents and develops the Jungle model in a credit portfolio framework. The Jungle 
model generates a probability distribution for the losses of a credit portfolio with the following 
stylized facts: 

(i) The Jungle model is able to model contagion among borrowers 

(ii) The Jungle model endogenously generates doubly-peaked probability distributions for the 
credit losses, with the second peak consistent with avalanches / domino effects out of 
contagion 

(iii) The Jungle model endogenously generates quasi phase transitions, meaning small changes 
in the portfolio may result in sudden and unexpected systemic risks. The Jungle model 
helps us to ascertain the location and nature of those quasi phase transition points 

We study a series of particular cases of the Jungle model, in particular the Dandelion model and 
the Diamond model. 

The Dandelion model exemplifies the emergence of doubly-peaked probability distributions. The 
Diamond model quantifies how and when a quasi phase transition may occur for the Jungle model. 

Model risk arises from the genuine model uncertainty: potentially, there will be many different 
Jungle models consistent with the set of available empirical data for our portfolio. As a consequence, 
by considering the potential systemic risks of this ensemble of Jungle theories allows us to address 
model risk. In particular, we have shown that for not too unreasonable data, some Jungle models 
endogenously generate a quasi phase transition, i.e. given small changes in the underlying empirical 
parameters may induce sudden changes in the collective behaviour of the system, potentially and 
inadvertently generating systemic events. Quasi phase transitions and doubly-peaked probability 
distributions represent a challenge for model risk. 

We show the Jungle model is able to handle inhomogeneous portfolios and state-dependent 
recovery rates. 

We argue the Jungle model provides a unifying framework to think about the three factors 
contributing to default clustering (macroeconomic risk factors, contagion and frailty). 

The analysis of the Jungle model in general, and of the Dandelion and Diamond models in par¬ 
ticular, leads to some policy implications of contagion. We are able to understand qualitatively 
some empirical evidence, such as the embedding of the U.S. subprime and the European periph¬ 
eral crises into the general framework of quasi phase transitions, the thick tails in the historical 
probability distributions, as well as the surprising fact that quite often, the worst quality credit 
portfolios end up with default rates lower than the corresponding ones with a better rating. We 
also analyse the ’’Too Big To Fail” phenomenon under our framework based on contagion, and 
we pictorially compare systemic risks out of contagion for a ” financial economy of big corporates” 
versus an economy of ” industrial entrepreneurs”. 

We believe the study of the Jungle model in the credit arena, especially for regulatory purposes, 
deserves further attention. 
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Appendix A: The Maxent principle 

Maxent asserts: 

Given a finite state space 12, the probability distribution in 12 that maximizes the entropy and 
satisfies the following m < card(12) constraints, given m different functions in 17, fk(x), and to 
fixed numbers F^: 


ifkjx)) ■■= ^ P(x)f k (x) = F k , A: = 1,2, (Al) 

xEfl 


as well as a normalization condition: 


(1) :=^P(i) = l 

xEfl 


(A2) 


is: 


P{x) 


1 

Z(Xi, A 2 , • 



^ Xifi(x) 


i= 1 


where Z is called the partition function: 


(A3) 


Z(x 1, A 2 , • • - ,A m ) = »p ( ~ Xjfi' 


(A4) 


i= 1 


The Lagrange multipliers A* are found by inverting the following set of m equations: 


Fk = (fk(x )) = 


d logZ(Ai, A 2 , • • • , A m ) 
d\ k 


k = 1,2, •• • ,m 


Let us prove the claim above: 

Entropy is a function on 17 defined by: 


(A5) 


n 

S(pi,--- ,Pn) ■= ~^2pi\og(pi) (A6) 

2—1 


where we define n := card (17). 

As a consequence, we want to find the probability distribution pi, i = 1, • • • ,n such that S is 
maximum for that choice of pt over any other possible choice of Pi, under the constraints: 

N 

• Eft = 1 
2—1 

n 

• (fk( x )} = Y2 Pifk(xi) = Fk, k = 1, 2, • • • , m and Xi denotes the i-th state 

2=1 

First, we can find an extremum of S through the method of Lagrange multipliers, by imposing: 
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0 = 6 


n m n 

S - (A 0 - 1) E pi 

i=l k= 1 i=l 


= E 


ds_ 

dpi 


(Ao 


1 ) - '2Zhfk(xi) 

k =1 


6pi 


After some algebra, we find: 


(A7) 

(A8) 


Pi = exp 



'y \ ^kfk (a 


k =1 


(A9) 


By imposing the constraint that all probabilities must sum up to one, we obtain: 


exp (A 0 ) = E exp ( “ E 

By defining Z := exp (Aq), we obtain: 


(A10) 


i= 1 


k =1 


P(x) = 


1 


A 2 , • ■ ■ , ^m) 


exp -E 


(All) 


fc=i 


And the other constraints are satisfied as follows: 


F k = (fk(x )) = 


•Z(Al, A 2 , ■ • • , Am) ^ 


E/*(xi) 


B exp 


1=1 


an = -- 


9 log Z(Ai, A 2 j ' ' ' 1 A m) 


Now we only need to prove the above extremum is in fact a maximum. 


3A fc 


iv 


(A12) 


Let us assume we have a set of non-negative numbers pi, i = 1, • • • , n such that X Pi = 1- Let 

i =1 

us assume the numbers Ui satisfy the same conditions. 

It is easy to derive that log(x) < x — 1, for non-negative x. The equality only holds for x = 1. 
Then 


n 

5> log 
2=1 



2=1 




(A13) 


which is equivalent to 


n 

S(pi,--’ ,Pn) < E^ lo g 
2=1 


1 


Ui 


(A14) 
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with the equality only holding for u t = pi, i = 1 , • • • , n. 
Now, defining: 


Ui 


1 

Z(\ 1 , A 2 , • 


m 

—— y^ ex p (~ x kfk{xi)) 

z^m) , 


(A15) 


The inequality above for the entropy can be written as: 


n 

S(pi,-" 1 Pn) < Y Pl 
2= 1 


m 

logZ(Ai, A 2 , • • • , A m ) + ^ A kfkjxj) 

k= 1 


logZ(Ai, A 2 , • • • , A m )+y^A fc (/fc(a:i)} 

fc=i 


(A16) 

Now, let us consider all possible probability distributions pi , i = 1, ,n that satisfy the 
initial constraints. In the equation above, the right hand side is a constant for all these probability 
distributions. Instead, the left hand side (the entropy) changes when changing the probability 
distribution, and its maximum only is attained (i.e., the inequality becomes an equality) when 
Pi , i = 1 , • • • , n becomes the following distribution: 


Afc/fc(xj)^ (A17) 

Appendix B: The binomial model 

The homogeneous Jungle model with external field but no fiij'. 

P(Zi,/ 2 ,---,M = ^exp (Bl) 

has a partition function which can be calculated exactly: 

/ N \ 11 1 / N \ 1 1 1 

Z = exp I a E'< =EE-"E exp I a E'< -E exp (al\) yy exp (cd 2 ) • • • y^ exp (aim ) 

, Zjv V 2=1 / / i =0 / 2 —0 In = 0 V 2—1 / 1 1 —0 Z2 —0 Zjv =0 

(B2) 

= (^Y exp ( al ^J = + e<x ) N 

(B3) 

The explicit knowledge of the partition function allows us to compute any value for the theory. 

For example, we can compute the expected value of the loss, and see how the a parameter directly 
relates to the empirical p: 


Pi = 


Z(X 1 , A 2 , • • • , ^m) 


exp 


m 

E 

k= 1 


(£) 1 d log Z 

A ~ N da 


51og(l + e a ) 


1 


da 


1 + e~ a 


(B4) 
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or equivalently: 


e 


—a 


1 — p 
p 


(B5) 


Now we can compute the probability distribution of the losses, recalling a probability distribution 
can be computed by counting in how many ways the states can be arranged to produce the given 
underlying variable, divided by the partition function: 


p (^2 li = (S ] = \ exp 


S>2=1 


/ \ 

N 

a l 

liiht’” ^2 h = & 


We recall there are (^) ways to choose l elements out of N. Then: 


(B6) 


P 



1 

Z 



(B7) 


By substituting Z with its value above, and a 


by its dependence on p above, we find: 




(B8) 


Appendix C: Small contagion 

We have seen in Appendix B that the Jungle model: 


l ( N \ 

P(h,h, ■ •• ,In) = 5F ex P I + T Pijhlj 

\ i=1 / 


(Cl) 


becomes a binomial distribution when fi t j = 0, Vz, j. In other words, the Jungle model with no 




1 ( N 

Pp=o{h,h, ■ ■ ■ Jn) = v?exp ( »y^Ji 


2=1 


becomes the binomial distribution: 


(C2) 


N 


p[J2 l ^ =i 


\i= 1 


N 


£/-, „\N-e 


p ‘(1 -p) 


(C3) 
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with p = 


1 


1 + , e_ " 

Now, we are interested in the Jungle model with the same a as above, but also with f3 12 =: (3 
and Pij = 0, for ij 7 ^ 12. In other words, we are interested in expanding perturb at ively around the 
binomial model, in order to understand the effect of (3. The corresponding Jungle model is then: 


Pp(hM, ■■■ Jn) = 77 exp 


f N 

V i= 1 


+ phi 2 


The corresponding partition function can be calculated exactly: 


(C4) 


N 


^ = --E«p »E h + PI1I2 


l\ —O /2 — 0 In — 0 


i= 1 


11 / 1 

e yi ex p ( a ( ?i ++^^ 2 ) (x] 

Z 1= 0Z 2 =0 \Z=0 

(C5) 

= e 2a + 2 e" + l) (1 + e ")^ -2 

(C6) 



Rearranging terms: 


= (e' ! -l) e 2 “(l+e“) w - 2 +(l+e“)' v = 1 ^ + ^ (l+e“)' , +(l+e°) ff = (l + (1 ^ p _l )2 ) 

(07) 

The explicit knowledge of the partition function allows us to compute any value for the theory. 
For example, we can compute the expected value of the loss of l \: 


1 i 1 / N 

Pi = {h)p = ''' 2 li exp I a. "y ^ + /5ZiZ 2 


1 1 


i 1= oi 2 =o \i =0 

Substituting the partition function and rearranging: 


l\ —O/ 2 —0 In —0 

! x N -2 
al 

1 — 

z 


i =1 


|ee /1 exp (a(Z 1 + I 2 ) + Phh) E e “‘ = i (e“ + e 2 " +/3 ) (1 + e") 


\AT-2 


(C8) 

(C9) 


Pi = (*l >/8 = 


S a + e 2a+/3 ( e /3 -!) + (! + e “«) 


e /3 e 2a + 2 e« + 1 ( e /3 - 1 ) + (1 + e _ “) 2 


(CIO) 


Taking f3 small (keeping only up to linear terms in j3) and substituting 1 + e "by 


6 + --3— 

0 n \ ~ Pe =0 

Pi = (h)p ~ 


— = P/3=o | ^ ^2 ° ~P/3=o(l + ^=o)(l-/3p|=o) ~P/3=o(l+/3p/3=o(l-P/3=o)) 


/3 + p -fc 1 “ V, 0 

(Cll) 

As a consequence, pp is slightly larger than pp =0 , despite both sharing the same a, and the 
increase is proportional to (3. 
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For (3 not necessarily small, it still holds true that p@ is larger than pp=o, the only difference with 
the small j3 case is that the increase in probability of default is not proportional to (3 any more. 

We can then state that small contagion effects around the binomial model are akin to increasing 
the probability of default of the system. 

We also can see that, in addition to increase the probability of default, (3 induces correlations: 


(hh)p 


d log Z 
d/3 


d log Zp = o 
dfj 


d log 


( 1 + (IT^) 

dp 


0 + 


1 e? 

e^-l (1 + e~ a 

1 H- 

(1 + e~ a ) 2 


2 


(C12) 


Rearranging terms and using the dependence of pp=o with or. 


(hh)0 = — 


+ e P -l 

Pf>=0 


Pp=0~- 


1 + Pp=o(eP ~ 1 ) 

Let us finally check the correlation between 1 and 2 is proportional to p. From: 


(C13) 


0 = _ (hh)p - (h)p(l2)p _ 

V(ii)„(i - (hbWVMi - (fcM 


(C14) 


We only need to take care of the numerator to accomplish our goal. Let us take up to linear 
terms in (3 for 


(hi 2)0 ~ p}= 0 1 + 2 13 P 2 0 = 0 (1 + ff(l ~ v\= 0 )) (C15) 

1 P0=oP 

Then, up to linear terms in f3: 

(hh)p - (h) 0 ( 12)0 ~ p}= o(l + /5(1 - P 2 0= 0 )) - P 2 0= o(l + 2^=o(1 - P0= 0 )) (C16) 

= P(P 2 0= o(! - P 2 0= 0 ) - 2^ =0 (1 - P0= 0 )) = Pp}= o(l - P0= o) 2 (C17) 

As a consequence, the correlation is proportional to f3 with a positive coefficient. 

We have finally proven that for small contagion, the coefficient (3 can be interpreted as (a simple 
function of) the default correlation, in the same way that for no contagion, a can be interpreted 
as (a simple function of) the probability of default. 


Appendix D: The Dandelion model 

The Dandelion model corresponds to a Jungle model with N +1 borrowers, such that the first one, 
defined as i = 0 and considered to be at the centre of the Dandelion, is ’’connected” to all remaining 
borrowers, at the external surface of the Dandelion, such that j3 $j =: /? 7 ^ 0 for i = 1,2,--- ,N. 
Any other borrowers remain unconnected, f3 t] = 0 for i = 1,2,- •• , N & j > i. For simplicity, 
we assume a, =: a for i = 1,2, ■ - ■ , N. 

The probability distribution for the Dandelion model is: 
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if N N \ 

P(h,h, ■ ■ ■ ,In) = -g exp I a 0 l 0 + a'^k + ft ^Ipk J 

\ i =i i=i J 

The Dandelion model, despite being interacting, can be fully solved: 

111 / N N 

Z = XX-"X exp Wo + a^ + ^X^ 

Iq=0 1 1 —0 l iv —0 V 2=1 2=1 

= E-E exp fa X z< ) X ex P fao + ^X Zi> ) 


l\ — 0 l N — 0 


2=1 / lo= 0 


2=1 


Summing over /q : 


l l 


N 


N 


z = x ■ ■ ■ X ex p a X li ) 1 + ex p («o +p X l i) 

h= 0 l N =0 V i= 1 / V i=l / 

1 1 / JV \ 1 1 / 1VN 

X-X exp ( a X li ) + eao X ■ ■ ■ X exp ( a +^)X^ 


Zi—0 Ziv—o 


2=1 


h —0 l tv —0 


2=1 


(Dl) 


(D2) 

(D3) 


(D4) 

(D5) 


In analogy with the binomial case discussed in Appendix B, the sums above can be calculated 
explicitly, giving: 


Z = (1 + e°) N + e“°(l + e a+l3 ) N 


(D6) 


And then: 


log Z = N log (1 + e a ) + log 1 + e 


c*o 


1 + e a+p 

1 + e Q 


AD 


(D7) 


With an explicit knowledge of logZ, as a function of a, «o and f3, we can derive p, po and p as 
a function of a, «o and /3, as described in the Maxent description: 


Po = (lo) 


d log Z (q 0 , a, f3) 
da o 


N 

E< 4 > 

2=1 


N 

ok) 

i= 1 


1 d log Z(ao, a, f$) 
N da 

1 d log Z(ao, a, f3) 
N d~/3 


where q can be derived from the definition of default correlation: 
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P = 


_ q-ppo _ 

\ 4>(1 ~P)V'Po( 1 ~Po) 


(D8) 


After a bit of algebra: 


P o = 


l+e° 


\N 


P = 


1 + e“ 


l + e a °(e /3 - 1) 


1 + e “°(i+ e -^ 

(1 + e a+p ) N ~ l 
(1 + e a ) N + e"°(l + e a +P) N J 

q = po l + e- a ~P 


We have three equations, relating our three empirical estimates, p, po and p, with the three 
variables of the model, a, ao and p. After a bit of algebra, the relations above can be inverted 
explicitly: 


a 0 = (N - 1) log 


1 ~Po 
Po 


+ IV log 


po-q 


P = log 


a = log 

q 


i - po - p + q 
p-q 

i-po-p + q 
i-po-p + q 


p o - q 


p-q 


The probability distribution for the losses of the Dandelion model can then be calculated explic¬ 
itly: 


N 


i= 1 


Summing over Iq: 


/ N N 

^ exp a 0 ^o + a y li + p y l 0 li 

loji,’” -h In = £ ' 


i= 1 i= 1 


(D9) 


P (i = y^ ^ = 4 y, ex P (*V) + ex P (“o + Zip- + P)) 


1=1 


li-,'" Jn\Ii~\ - \~In—& 


There are (^) equal terms in the sum, finally giving: 


P = y ^ ^ (^j (exp (at) + exp (a 0 + + P))) 


(DIO) 


(Dll) 


where Z, a, ao and P depend on p, po and p as shown before. 
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Appendix E: The Diamond model 

The Diamond model is defined by: 


N 


Z = ^ exp I a Y l i + P Y ^ 


(El) 


Since 


N N N 


f 2 = T,i‘T, l i = T, l i + 2 T, l ‘ l i 


i —1 j =1 2=1 


i>j 


(E2) 


We can rewrite the partition function as: 


z= E 

1 1 5 ^2vj^JV 


(E3) 


Due to homogeneity, we can sum over states with the same with a degeneracy of (^: 


z= g(cM(H ) <+ ^ 2 


(E4) 


And the corresponding probability distribution for the losses will be: 


N 


p(e = Yji ) = 


2=1 


jV\ ex P ((« - fK + f^ 2 ) 


(E5) 


We can relate the empirical data, p and p to the model parameters a and /3 by: 


P = 


Q = 


T,m 

i>j 


N 

Si {k) = 1 aiog Z(a,(3) 
N N da 


2 8 log Z(a,/3) 


n(n-i) n(N- 1) d(3 


Then: 
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2 

q ~ ZN(N - 1) 



e=o v 7 








